Fast and Scalable Real-Time Monitoring System for Beowulf Clusters

نویسندگان

  • Putchong Uthayopas
  • Sugree Phatanapherom
چکیده

Fast real-time monitoring of system information is important to the understanding of parallel system especially for a large cluster system that appeared recently. Making the system fast and scalable at the same time is still a challenging task. This paper presents the design and implementation of a fast and real time monitoring system called SCMS/RMS. This system is a part of more comprehensive cluster management tool called SCMS. SCMS/RMS is designed to be flexible, highly scalable, and efficient. Many techniques that are used to increase the monitoring speed and to achieve high scalability have been described in this paper. The experiment has been conducted on a 72 nodes Beowulf Cluster and the results show that SCMS/RMS is very fast and highly scalable.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Scalable parallel FFT for spectral simulations on a Beowulf cluster

The implementation and performance of the multidimensional Fast Fourier Transform on a distributed memory Beowulf cluster is examined. We focus on the the three dimensional (3D) real transform, an essential computational component of Galerkin and pseudo-spectral codes. The approach studied is a one-dimensional domain decomposition algorithm that relies on communication-intensive transpose opera...

متن کامل

Beowulf – A New Hope for Parallel Computing?

The Beowulf model for clusters of commodity computers[15, 17, 18] has become very popular over the last year, particularly amongst university research groups and other organisations less able to justify large procurements. The Beowulf concept is usually applied[16] to clusters of Personal Computers running Linux, but other platforms and operating systems can also be considered as providing simi...

متن کامل

ACL 2 for Parallel Systems Software : A Progress Report

A significant development in high-performance computing has occurred in recent years with the proliferation of “Beowulf” clusters [6]. Beowulf clusters are parallel computers assembled from commodity-priced personal computers and networks. The explosive growth of the personal computer marketplace, together with rapid technological advances in the hardware sold there, has driven the price/perfor...

متن کامل

An IP-level Network Monitor and Scheduling System for Clusters

Current systems for managing workload on clusters of workstations, particularly those available for Linux-based (Beowulf) clusters, are typically based on traditional process-based, coarse-grained parallel and distributed programming. The DESPOT project is building a sophisticated thread-level resource-monitoring system for computational, storage and network resources based on SGI’s Performance...

متن کامل

Optimizing Latency in Beowulf Clusters

This paper discusses how to decrease and stabilize network latency in a Beowulf system. Having low latency is particularly important to reduce execution time of High Performance Computing applications. Optimization opportunities are identified and analyzed over the different system components that are integrated in compute nodes, including device drivers, operating system services and kernel pa...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001